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Final  Report  ( ARO-D A AH04-95- 1-0665)  Some  Topics  in 
Data  Analysis  and  Stochastic  Modeling 

(4)  Statement  of  the  problem  studied. 

As  stated  in  the  Abstract  of  the  original  proposal:  “The  analysis  of  data  absent  some  consideration  for 
the  models  of  the  systems  which  generated  them  frequently  causes  the  analysis  to  be  less  effective  than  it 
could  be.  Much  of  the  work  in  nonparametric  function  estimation  has  been  content  to  stop  at  the  level,  say, 
of  a  nonparametric  regression  curve  which  gives  little  clue  as  to  the  underlying  processes.  We  propose  to 
continue  a  line  of  research  in  which  model  building  and  data  analysis  proceed  hand  in  hand.” 

The  purpose  of  this  research  has  been  to  develop  methodologies  where  models  are  developed  and  tested 
in  the  light  of  data.  We  have  used  a  number  of  settings  to  develop  this  work. 


(5)  Summary  of  the  most  important  results. 

The  book  Simulation:  A  Modeler’s  Approach  [9]  embodies  a  general  theory  for  model  based  simulation. 

We  have  proposed  a  Bayesian-Paretan  quality  optimization  model  for  use  in  NASA’s  Space  Station  project 
[5].  This  is  a  prospective  model,  as  present,  since  the  Space  Station  is  yet  to  be  constructed.  We  have  also 
developed  models  for  dealing  with  quality  improvement  via  statistical  process  control  in  situations  where  the 
output  data  to  be  examined  is  of  high  dimensionality  [1—2]. 

We  have  also  noted  discrepancies  between  the  efficient  market  theory  models  of  neoclassical  economics 
and  the  reality  of  the  market  [7,  9,  13,  14].  Management  Science  frequently  overlooks  the  fact  that  the 
market  is  based  on  the  production  of  real  goods  and  services,  which  production  should  be  constantly  under 
improvement.  We  have  developed  an  integrated  approach  in  which  entrepreneurship,  finance  and  statistical 
process  control  are  employed,  [7,  9,  13,  14].  We  have  worked  and  are  working,  for  example,  to  improve  the 
efficient  market  model  of  Black,  Scholes  and  Merton — which  model  has  produced  poor  results,  for  example, 
the  failure  of  LCTM.  The  modeling  of  economic  phenomena  is  one  of  those  which  is  made  much  more 
tractable  by  simulation  based  techniques.  We  have  developed  [3]  forecasting  models  for  natural  gas  prices 
which  outperform  the  algorithms  currently  in  use.. 

We  have  used  the  AIDS  data  base  of  the  WHO  AIDS  Working  Group  to  determine  the  driving  force 
of  AIDS  in  the  First  World.  Our  conclusion,  based  on  the  data,  is  that  it  is  the  massive  United  States 
AIDS  infective  pool  which,  due  to  the  relative  cheapness  of  international  travel,  drives  the  AIDS  epidemic  in 
Canada  and  Europe  [6,  8,  9],  Also,  we  examined  the  danger  of  the  immunosuppression  of  HIV  in  facilitating 
an  adjuvant  tuberculosis  epidemic  [10—12]. 

We  have  shown  how  the  problem  of  nonparametric  data  analysis  with  higher  dimensional  data  can  be 
structured  by  first  finding  centers  of  high  density  and  then  using  Gaussian  approximations  locally.  We 
have  observed  that  the  “curse  of  dimensionality”  is  frequently  not  real  when  one  uses  the  mode-finding 
preprocessing  step.  We  have  used  mode  finding  approaches  to  investigate  Army  ballistics  data  of  high 
dimensionality  and  found  that  one  can  perform  very  well  in,  say,  eight  space  with  samples  in  the  few 
hundreds  range  [9,  Chapter  11]. 

We  have  demonstrated  how  the  SIMEST  [4,  9]  algorithm  of  Thompson  can  be  used  in  practical  situations 
to  estimate  the  parameters  of  deep  models  using  simulation-based  techniques. 
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